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ABSTRACT 


'V 


Organizations  with  many  different  products  may  find  it  convenient 
to  rep! ace^d ^ioc  statistical  analyses  with  a  uniform  approach  that 

comprehends  testing  and  estimation.  Some  organi zatl ons r-1 1  ke-MRCAr— _ Q, 

find  it  imperative  to  move  in  this  direction.  This  paper  indicates  how 
an  information  theoretic  approach  via  the  MDI  (minimum  discrimination 
information)  statistic  can  be  used  for  this  purpose.  Extensions  to 
constrained  versions  of  the  MDI  statistic  also  make  it  possible  to  test 
the  consistency  of  market  information  with  management  plans  or  policies 
that  can  be  represented  in  constraints  formulated  without  reference 
to  the  data  base  and  to  estimate  their  Impact  on  the  market.  Composite 
hypotheses,  which  are  difficult  to  deal  with  by  the  more  customary 
methods  used  in  market  research,  can  be  dealt  with  naturally  and 
easily  via  the  MDI  approaches.  Basically  MDI  is  more  efficient  than 
classical  approaches  because  distribution  estimation  and  hypothesis 
testing  are  done  simultaneously.  Numerical  illustrations  are  supplied 
and  discussed  in  the  context  of  market  segmentation.  Developments  in 
statistics  and  mathematical  programming  (duality)  theory  and  methods 
are  also  briefly  examined  for  their  bearing  on  still  further  possibilities 
being  opened  for  constrained  MDI  modeling. 


Market  Segmentation 
Market  Equilibrium 
Switching  Equilibrium 
MDI  Statistic 
Composite  Hypotheses 
Mathematical  Programming 
External  Constraints 
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1.  INTRODUCTION* 


This  paper  centers  on  a  development  of  constrained  information 
theoretic  statistics  with  accompanying  algorithms  and  illustrations  for 
use  in  marketing.  It  is  pointed  toward  composite  hypothesis  testing,  as 
in  a  market  segmentation  analysis  with  explicitly  stated  constraints. 
Statistical  testing  —in  multi-way  contingency  table  analyses,  for 
instance  —  is  not  usually  undertaken  with  explicitly  stated  arrays  of 
constraints.  The  recently  published  book.  The  Information  in  Contingency 
Tables  by  Gokhale  and  Kullback,  1978,  provides  requisite  statistical 
methods  and  rationales  for  such  treatments.  This,  in  turn,  opens  the  way 
for  contact  with  other  parts  of  the  management  sciences  where  complex 
arrays  of  constraints  are  often  used  to  reflect  a  variety  of  "policy" 
and/or  "data"  conditions. 

Our  illustrations  will  be  effected  from  the  data  published  in 
Ehrenberg  and  Goodhardt,  1974,  where  these  data  were  used  to  show  that 
(a)  the  Hendry  Brand  Switching  Coefficient  approach  to  market  segmentation 
does  not  yield  very  good  estimates  of  brand  switching  behavior  and  (b)  this 
market  --  like  most  markets-^  —  is  unsegmented  (i.e.,  it  consists  of  a 
single  segment).  They  do  not,  however,  submit  their  results  to  statistical 
tests. 


*The  authors  are  indebted  to  D.  G.  Morrison  and  an  anonymous  referee 
for  editorial  suggestions  that  have  helped  to  improve  this  revision  of 
earlier  drafts. 

1/In  a  private  communication  from  G.  H.  Goodhardt  dated 

September  27,  1979,  he  states  that  it  has  been  their  experience  that 
most  markets  can  be  adequately  described  as  being  unsegmented. 


Me  shall  show  how  the  Ehrenberg-Goodhardt  results  can  be  sta¬ 
tistically  tested  relative  to  other  alternatives  by  reference  to  their 
data,  but  we  do  not  propose  to  enter  into  yet  another  discussion  of  the 
Hendry  approach  to  market  segmentation.-^  We  shall  also  steer  clear  of 
full-scale  segmentation  analysis  by  reference  to  product  benefits, 
customer  (demographic)  characteristics,  etc.,  and  treat  these  data  as 
only  a  numerical  illustration  for  the  models  and  methods  we  shall 
supply.  The  following  development  thus  provides  only  a  beginning 
but,  as  will  be  seen,  it  is  especially  well-suited  for  statistically 
testing  "nested  hypotheses"  such  as  are  implicit  in  market  segmentation 
studies.  As  we  shall  also  note  (and  illustrate),  these  constrained 
information  theoretic  models  and  methods  possess  statistical  estimation 
as  well  as  hypothesis  testing  properties  that  may  be  simultaneously 
exploited. 

2.  BACKGROUND 

Marketing  profess ionalc  are  familiar  with  the  use  of  two-way 
contingency  tables  (cross-tabulations).  Formally,  such  two-way  analyses 
can  be  extended  to  multi-way  contingency  tables,  but  then  one  confronts 
a  variety  of  inadequacies  in  classically  available  statistical  methods. 

The  unsatisfactory  nature  of  our  ability  to  deal  with  large  multi-way 
contingency  tables  has  begun  to  give  way  before  many  different  separate 
developments  in  the  statistics  and  computing  sciences  literatures.  The 
resulting  proliferation  of  methods  has  given  rise  to  a  need  for 
systematization  accompanied  by  a  unifying  rationale  based  on  methodological 

— ^See  e.g.,  the  critique  in  Ehrenberg  and  Goodhardt,  1979,  of  the 
presentation  by  Kalwani  and  Morrison,  1977. 
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as  well  as  conceptual  grounds.  The  recently  published  book.  Discrete 

Multivariate  Analysis,  by  Bishop,  Flenberg  and  Holland,  1975,  explicitly 

acknowledges  that  their  effort  was  undertaken  in  response  to  this  need.-^ 

These  authors  use  the  "log-linear  model”  as  a  basis  for  "codifying" 

approaches  that  might  be  employed.  The  approach  via  log-linear  models 

falls  short  of  what  is  required,  however,  to  supply  the  unifying  rationale 

that  is  needed.  This  is  supplied  In  Gokhale  and  Kullback,  1978a  who  use 

the  MDI  statistic  for  this  purpose  and  who  show  how  the  log-linear  model 

2/ 

itself  can  be  derived  from  this  statistic-  along  with  a  variety  of  other 
approaches.  They  also  show  how  extensions  beyond  any  of  these  other 
approaches  --  e.g.,  including  additional  constraints  beyond  those  of  the 
contingency  tables  --  can  be  effected  along  these  "Information  theoretic" 

1 i nes . 

We  start  with  what,  in  the  literature  of  mathematical  statistics, 
is  called  the  "Kullback-Leibler  statistic"-^  which  is  of  the  form 

n  Pi 

(1.1)  I  (p:iO  •  E  Pi  log  — 

i  =1  1  71  i 

where  p  and  n  are  vectors  with  components 

P.  #  IT.  >  0 

n  n 

(1.2)  2  p.  =  E  tt  =  1  . 

i =1  1  i =1  1 


1/pp.  1-3. 

2/pp.  38-39. 

3/After  Kullback  and  Lei  bier,  1951.  We  have  elsewhere  also  called  :his 
the  Khinchin-Kullback-Leibler  statistic  --  see  Charnes,  Cooper  and 
Seiford,  1978, ..because  of  the  earlier  (but  simpler)  contribution  in 
Khinchin,  1949. 


Here  the  tt.  may  represent  a  set  of  hypothesized  (constant)  values  which 
are  to  be  tested  relative  to  the  p^  (variable)  values  estimated  from  ob¬ 
served  data.  The  ^  may  also  represent  a  set  of  prior  probabilities  as 
in  Bayesian  decision  theory  and  then  the  p^  become  posterior  probabilities 
determined  from  sample  observations.  Other  interpretations  are  also  pos¬ 
sible  which  accord  with  different  ways  of  choosing  the  p^.  When  the 
components  of  p  represent  minimizing  values,  p*,  the  expression  in  (1.1) 
may  be  replaced  by 


(2) 


1  ( p* :  it  ) 


n  p,*  n  Pi 

=  I  p.*  log  —  *  min  E  p*  log  —  , 

1  =  1  1  71  i  P  i *1  1 


and  this  is  called  the  MDI  (=Minimum  Discrimination  Information)  statistic. 

We  may  explain  this  name  along  the  following  lines:  Consider  any 
estimate  of  p  in  (1.1)  which  yields  a  distribution  that  significantly  differs 
from  the  distri bution  as  hypothesized  in  tt.  If  the  p  distribution  differs 
significantly  from  the  hypothesized  tt  distribution,  might  not  some  other 
estimate  of  p  (also  consistent  with  the  data)  fail  to  exhibit  significance? 
The  problem  is  resolved  if  p  is  chosen  "as  close  as  possible"  to  tt.  This  is 
the  meaning  of  minimum  discrimination;  i.e.,  p*  provides  minimum  discrimina¬ 
tion  against  the  hypothesized  tt  that  the  data,  together  with  any  other 
constraints,  will  allow. 
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Following  the  original  Shannon-Wiener  information  theoretic  for¬ 
mulations,  the  measure  of  information  expression  (1.1)  was  originally 
regarded  as  a  probabilistic  rather  than  a  statistical  concept.-^  In  his 
poineering  book,  however,  S.  Kullback,  1959,  was  able  to  provide  a 
statistical  foundation  and  to  show  how  (1.1)  and  (2)  could  be  used  to 
unify  an  extremely  wide  variety  of  statistical  concepts  and  developments. 

Various  statisticians,  especially  in  the  Soviet  and  Japanese 

2/ 

literatures,  have  continued  to  push  forward  vigorously  along  these  paths. - 
The  Japanese  statistician,  H.  Akaike,  for  example,  presented  a  very  impor- 

3/ 

tant  paper  at  a  Soviet  sponsored  conference—  in  which  he  showed  how 
maximum  likelihood  estimation  (the  heart  of  classical  statistics)  could 
be  given  a  decision  theoretic  formulation  asymptotically  equivalent  to 
the  MDI  so  that  the  two  approaches,  maximum  likelihood  and  decision  theory, 
could  be  unified  on  one  common  basis.  The  MDI  method  has  the  unique 
characteristic  of  providing  both  hypothesis  testing  ard  statistical 
estimation  regardless  of  the  conclusion  of  the  test.  It  thus  also  provides 
a  decision  theoretic  method  with  unifies  hypothesis  testing  and  estimation. 

Regression  (sometimes  of  a  logit  or  probit  variety)  and  factor  analysis 
are  often  used  techniques  in  marketing  research.  Current  statistical 
procedures  usually  determine  the  number  of  terms  to  be  used  on  a  trial 
and  error  -  or  exhaustive  sequential  —  basis.  Akaike,  1973,  however,  shows 


i/see  p.  2  in  Kullback,  1959. 

— ^The  econometrician,  H.  Theil,  has  also  pushed  forward  vigorously 
with  a  wide  variety  of  publications  on  these  topics  in  the  American- 
English  literatures  (see  e.g.  Theil,  1976)  and  is  now  readying  a 
book  on  information  theoretic  approaches  to  estimating  statistical 
distributions.  See  Theil  (in  process). 


-See  Akaike,  1973. 
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that  the  MDI  approach  Immediately  determines  the  number  of  terms  to  be 
used  from  the  sample  data  by  the  MDI  decision  theoretic  criterion  alone. 

There  have  been  other  extensions  as  well.  Gokhale  and  Kullback,  1978, 
have  extended  contingency  table  analyses  to  include  additional  constraints. 
These  constraints  are  of  the  form 


) 


,  1  B  1 ,  2 ,  . . . ,  m 


When  the  0.  are  derived  from  the  data,  as  in  the  case  of  marginal  totals 
for  a  conti gnecy  table,  then  the  constraints  are  said  to  be  "internal 
constraints".  When  they  are  imposed  on  the  basis  of  various  hypothesized 
premises  (and  not  by  the  data),  as  in,  for  instance,  an  assumed  segmentation, 
then  the  constraints  are  said  to  be  "external  constraints."  It  is  the  latter 
which  will  be  emphasized  here  if  only  because  this  provides  new  opportunities 
for  dealing  with  managerial  plans  or  policies  that  cannot  be  accommodated  by 
internal  constraints. 

Notice  the  flexibility  that  is  allowed  by  reference  to  the  possibility 

of  testing  hypotheses  with  the  ir  values  in  the  functional  or  the  6  values  in 

the  constraints.  Choice  of  the  minimizing  p*  in  terms  of  (1.1),  (2)  and  (3) 

makes  contact  with  mathematical  programming  with  its  great  computational 

power  and  the  interpretations  that  are  available  from  the  underlying  duality 

relations.  These  prospects,  too,  have  now  been  formally  effected  as  in 

2/ 

Charnes  and  Cooper,  1974,  and  Chames,  Cooper  and  Seiford,  1 978—' with  the 


-  Akaike  also  brought  these  methods  to  bear  on  the  so-called 

"James-Stein  paradox"  wherein  by  the  use  of  seemingly  irrelevant  data 
one  can  secure  improved  estimates  which  are  not  only  more  efficient  than 
the  mean  (a  maximum  likelihood  estimator)  but  which  also  even  eliminate 
the  mean  from  the  admissible  class  of  estimators  in  a  decision  theoretic 
sense.  See  Akaike,  1977,  and  Akaike,  1979,  where  the  inadequacies  of 
Bayesian  approaches  to  this  topic  are  also  discussed.  (An  amusing  and 
insightful  article  on  the  James-Stein  paradox  may  be  found  in  Efron  and 
Morris,  1977,  and  a  more  general  treatment  of  the  deficiencies  of 
maximum  likelihood  estimators  may  be  found  in  Weiss  and  Wolfowitz,  1974.) 

£/See  also  Charnes,  Cooper  and  Tyssedal  (forthcoming). 


result  that  an  unusually  simple  (unconstrained)  convex  programming  problem 
is  found  to  be  the  dual  to  minimizing  (1.1)  subject  to  (1.2)  and  (3)-^, 
which  dual  is 

n  n 

(4)  Max  r  0,z.  -  £  it.  exp(I  z.a.,) 

z  i-0  1  1  j=i  3  1  1 

where  Qq=  1,  a0j  =  1,  j  =  1,  ...n  and  !,exp"  denotes  the  exponential  function, 

e  =  2.718 -  See  Charnes  and  Cooper,  1974,  and  Charnes,  Cooper  and  Seiford, 

1978,  and  observe  that  the  choices  of  the  components  in  z  are  not  constrained. 

There  are  other  advantages  that  can  also  be  secured.  For  example,  as  we 
have  elsewhere  shown  —  Charnes,  Cooper,  Learner  and  Phillips,  1980c  --  MDI 
can  provide  a  basis  for  unifying  a  great  variety  of  seemingly  separate  (and 
different)  approaches  to  marketing  research.  Here  we  want  to  show  how  to 
bring  it  to  bear  on  problems  such  as  the  composite  hypothesis  testing 
required  for  market  segmentation  testing. 

3.  A  SEGMENTATION  THEOREM  AND  ALGORITHM 
Although  information  theoretic  approaches  can  be  found  in  the  marketing 
literature,  the  example  applications  mainly  take  the  form  of  the  simpler 
"entropy"  formula 


n 

(5)  Z  p,  log  p. 

i  =  1  1  1 


-  l t  is  perhaps  of  interest  to  note  that  (1.1)  provides  a  "proper  go*1 
functional"  in  the  sense  of  Charnes  and  Cooper,  1977,  for  use  ir 
goal  programming. 


applied  to  areas  like  brand  switching  or  market  segmentation.  Examples  of 
such  uses  of  the  entropy  concept  in  the  marketing  literature  may  be  found 
in  the  articles  by  Herniter,  1973  and  1974,  and  by  Bass,  1974^/  Variants 
may  also  be  found  in  the  article  by  Kalwani  and  Morrison,  1977,  as  well  as 
in  other  articles  dealing  with  the  Hendry  approach  to  brand  switching  and 
market  segmentation  analysis.  See,  e.g.,  Butler  and  Butler,  1971, and 
Hendry,  1970,  1971. 

We  elect  to  make  contact  with  this  part  of  the  marketing  literature 
via  the  data  of  Table  1  which  Ehrenberg  and  Goodhardt,  1974,  used  to  test 
the  Hendry  approach.  They  concluded  that,  on  the  evidence,  this  market 
does  not  exhibit  any  segmentation  at  all In  arriving  at  this  conclusion, 
Ehrenberg  and  Goodhardt  use  a  first-order  analysis  of  the  market  shares. 

Within  a  Hendry  analysis  one  must  check  to  insure  that  the  switching  constants 
derived  from  these  ratios  "is  applicable  to  the  total  product  class  as  well 
as  to  the  individual  brands  within  a  product  class.-7  More  generally  one 
might  commence  with  the  (relative)  market  share  shown  for  each  product,  and 
then  go  on  to  consider  pairs,  followed  by  triplets,  and  so  on,  to  the 
2n-l =1 27  possibilities  for  the  example  shown  in  Table  1.  The  idea  is  to 
check  to  see  whether  the  resulting  ratios  are  approximately  the  same  in  order 
to  avoid  (or  reduce)  the  danger  of  reaching  erroneous  conclusions  from  the 
(perhaps  accidental)  equality  of  lower  order  ratios. 

Ehrenberg  and  Goodhardt  do  not  undertake  any  of  these  higher  order 
calculations,  but  their  approach  is  nonetheless  satisfactory  by  virtue  of 
the  following  result 

-^See,  also  Carter,  1979,  and  Haynes,  Phillips  and  Mohrfield,  1980. 

2/ 

-  As  elaborated  in  a  private  communication  where,  as  noted  on  p.l  in 
our  introduction,  Ehrenberg  and  Goodhardt  believe  that  most  markets 
can  be  adequately  described  as  being  unsegmented. 

3/ 

-  Quoted  from  Kalwani  and  Morrison,  1977,  p.470. 
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If  * 


Aj.  s^S^c 

Then  f°r  a11  ke  U . n)  and  all 

k  seS  s  SC{1,  ...,  n}. 


where  A^,  are  real  numbers  with  all  t  0.  In  other  words,  when  the 
first  order  ratios  are  equal  then  all  the  higher  order  ratios,  however 
they  may  be  formed,  will  also  be  equal. 

This  result,  which  we  have  proved  elsewhere,-^  is  employed  in  the 
first  part  of  our  proposed  algorithm  enabling  us  to  follow  Ehrenberg  and 
Goodhart  in  restricting  ourselves  to  first  order  calculations  in  the  use 
of  formula  (7),  below.  We  observe  that  (6)  holds  without  reference  to  the 
way  the  A<.  and  Bs  values  are  obtained  and  then  pass  on  to  an  initial 


segmentation  via 


p,  -  P(i >i ) 

R  =  _J - 

P(  (1  -  V 

where  p^  =  market  share  for  brand  i 

p(i,i)  =  proportion  making  repeat  purchases  of 

brand  i . 


-^See  Charnes,  Cooper,  Learner  and  Phillips,  1980a. 
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Our  use  of  Table  1  is  only  illustrative  and  so  we  do  not  make  a 
detailed  examination  of  the  way  these  data  were  obtained  or  treated. 
Neither  all  rows  nor  all  columns  seem  to  unity--see  Table  2—  so  that 
evidently  the  matrix  is  not  Markovian.  Even  if  they  did  sum  to  unity 
this  would  not  mean  that  the  process  is  considered  to  be  Markovian 
and  the  fact  that  Ehrenberg  has  been  openly  critical  of  the  use  of 
first-order  Markov  processes  in  describing  consumer  behavior-^ 
disinclines  us  to  treating  these  data  in  this  manner.  We  Shall 
therefore  interpret  the  p  (i,i)  as  joint  rather  than  conditional 
probabilities. 

Applying  (7)  to  the  data  of  Table  1  we  obtain  the  following 
results 


Rj  =  0.660 

R4  «  0.719 

(8) 

R2  =  0.656 

R5  =  0.750 

R3  =  0.723 

R6  =  0.704 

Using  (6)  we  then  assert  that  i  and  j  are  in  the  same  classification  when 

R.  and  R.  differ  only  because  of  sampling  errors.  Following  Ehrenberg  and 
i  J 

Goodhardt,  1974,  we  have  used  the  average  of  the  row  and  column  marginal 

2/ 

values  in  arriving  at  these  results.  This  assumes  "market  share  equilibrium"- 
between  the  two  periods,  a  hypothesis  that  is  subject  to  test  along  lines 
that  we  shall  indicate.  This,  however,  is  not  the  only  possible  equilibrium 
that  might  be  of  interest  for  marketing  purposes.  For  instance,  we  shall 
introduce  the  idea  of  "switching  equilibrium"  and  show  how  this,  too,  may 
be  tested. 


-^See  Ehrenberg,  1965,  and  his  1968  rejoinder  to  Massey  and  Morrison,  1968. 

9  • 

-See  Ehrenberg  and  Goodhardt,  1974. 
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Ehrenberg  and  Goodhardt,  1974,  conclude  from  their  analyses  that  this 
market  does  not  segment  into  different  classes.  I.e.,  they  conclude  that 
there  is  only  one  "homogeneous"  market  in  which  all  products  compete. 

They  do  not  test  this  hypothesis  by  statistical  methods.  As  a  byproduct 
of  our  use  of  a  series  of  composite  hypothesis  testing  procedures  with  an 
accompanying  algorithm,  we  shall  now  show  how  such  statistical  tests  may  be 
conducted  with  MDI  methods. 

TABLE  1 


The  Observed  Brand-Switching  Percentages 


for  Six  Brands  of  Breakfast  Cereals 

(Two  successive  purchases  per  consumer) 

_ Second  Purchase  A1 1 

Product  First  Purchase  l _ 2 _ 2 _ 4 _ 5 _ 6_  Buyers 


Corn  Flakes 

(23.8) 

7.7 

3.3 

2.0 

1.3 

1.4 

39.5 

Wee tab ix 

6.4 

(14.3) 

3.3 

1.1 

.8 

1.1 

27.0 

Shredded  Wheat 

3.6 

3.2 

(5.4) 

.8 

.8 

.7 

14.5 

Sugar  Puffs 

3.2 

1.5 

1.1 

(3.1) 

.8 

.3 

10.0 

Puffed  Wheat 

1.7 

.6 

.4 

.6 

(1.5) 

.1 

4.9 

Brand  X 

1.0 

.4 

.6 

.3 

.3 

(1.5) 

4.1 

All  Buyers 

39.7 

27.7 

14.1 

7.9 

5.5 

5.1 

100.0 
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4.  VULNERABILITY  RATIOS 

These  Rj  values  are  sometimes  referred  to  as  "switching  constants" 
—  e.g..  In  a  Hendry  analysis— but  we  shall  not  follow  that  usage  here. 
We  shall  Instead  refer  to  them  as  "vulnerability  ratios."  Justification 
for  this  change  in  terminology  may  be  presented  in  terms  of  (7)  by 
rewriting  it  in  the  following  "verbalized"  form: 

(91  1  repeat  purchase  rate  for  i 

{  >  1  brand  share  for  i 

Rj 

1  -  brand  share  for  i 

The  smaller  the  value  of  this  ratio,  the  less  vulnerable  is  the  brand 
in  the  sense  that  smaller  R.  reflect  an  increasing  proportion  of  repeat 
buyers  in  the  brand's  present  market  share.  It  is  repeat  buying  on 
which  this  measure  focuses.  For  instance,  i f  R -  58  0,  then  the  brand 
share  for  i  =  the  repeat  purchase  rate  for  i,  and  all_  of  this  brand's 
customers  are  repeat  buyers. 

We  keep  these  interpretations  general  and  restrict  our  terminology 
accordingly.  For  instance,  we  do  not  assess  the  reasons  for  the  observed 
behavior  and  we  allow  assumptions  of  a  probabilistic  character  which 
may  include  various  specialized  statistical  distributions  used  by  others. 
As  a  case  in  point  we  cite  Sabavala  and  Morrison,  1977,  who  use  a  Beta- 
Binomial  development  to  obtain  a  "Loyalty  Index"  for  viewing  TV  programs. : 


-^We  are  indebted  to  D.G.  Morrison  for  pointing  out  the  additional 

possibilities  and  relations  derivable  from  Sabavala  and  Morrison,  1977 
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In  this  case  more  special  (and  precise)  characterizations  are  possible 
since  the  condition  Ri  *  0  then  also  implies  a  bipolar  population  in  which 
consumers  all  have  purchase  probabilities  near  0  or  1  for  any  such  brand. 
Results  like  this  suggest  additional  hypotheses  for  testing,  such  as 
situations  particularized  further  within  the  situation  p^  3  p...  The 
latter  is  consistent  not  only  with  high  or  low  p..  values  but  with  inter¬ 
mediate  values  as  well. 

Another  critical  value  of  the  vulnerability  ratio  is  at  =  1  where 
we  have 

(10)  p(i  »i )  =  P^. 

If  we  interpret  this  in  terms  of  probabilities,  the  repeat  purchases  are 
statistically  independent  identically  distributed  events.  This  would  be  the 
situation  for  a  zero  order  process-^  in  which  all  consumers  have  the  same 
probability,  p^ ,  of  purchasing  brand  i  where,  as  we  may  also  note, Pi  may  be 
either  small  or  large. 

For  more  general  zero  order  processes  involving  heterogeneous 

2/ 

populations  it  has  been  shown  that  0  <  <  1.— 

-  t 

-  See  p.  51  in  Massy,  Montgomery  and  Morrison,  1970. 

?  / 

-'We  owe  this  result  to  an  anonymous  referee. 
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There  are,  of  course,  still  other  possibilities  that  may  occur  and  so 
we  should  also  allow  for  situations  in  which  >  1  as  will  occur,  for 
example.  If  p  (1,1)  *  0,  i.e..  If  there  are  no  repeat  purchases  as  in  fad 
or  fashion  items.  We  should  also  like  to  know  the  implications  of  R^  >1 
if  these  values  are  to  be  used  for  monitoring  possible  changes  in  market 
behavior. 

Now  R.  >1  if  and  only  if 

(11.1)  p<  -  p(i,i) 

J -  >  i 

p,  (i  -  Pf> 

so  that  also  >1  if  and  only  if 

(11.2)  p(i,i)  <  Pi2, 

or  the  repeat  purchase  probability  is  smaller  than  would  be  obtained  from 
statistically  independent  purchase  behavior.  Notice  that  the  emphasis  is 
on  repeat  purchases  relative  to  other  purchase  behavior. 

We  carry  this  analysis  a  step  further  and  rewrite  (11.1)  as 

(12.1)  Pi  -  P  ( i , i )  >  Pi  (1  -  P|). 

Now 

p.  =  p(i,i)  +  E  p., 

1  JJH  1J 

and 

Pj  5  1  Pr 
1  nH  r 

so  that  substitution  in  (12.1)  yields 


(12.2.) 
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However,  p^  >  p  (i  ,1 )  and  pf  >  p^  and  therefore 


(13) 


£  P.-.-  >  P  (1,1)  l 
3ti  1J  rjH 


so  that  the  probability  of  switching  out  (on  the  left)  exceeds  the  probability 
of  staying  in  times  the  probability  of  switching  in  (on  the  right). 

The  information  supplied  by  these  Ri  values  can  be  put  to  use  in 
developing  market  strategies,  with  higher  values  of  R.  being  associated 
with  more  vulnerable  products.  Additionally  these  ratios  may  direct 
attention  to  possibilities  for  attracting  customers  from  brands  with  R. 
values  exceeding  one's  own  or  of  possible  inroads  from  brands  with  smaller 
R1  values,  which  may  have  staying  power  because  of  repeat  purchasing 
propensities. 

Differences  in  observed  Ri  values  may  not  be  statistically  significant, 
in  which  case  the  above  interpretations  will  need  to  be  modified.  Testing 
for  statistical  significance  of  these  and  other  hypotheses  is  a  task  to 
which  we  shall  now  turn,  but  our  focus  on  this  aspect  of  testing  does  no 
mean  that  we  believe  that  other  considerations  such  as  application  marketing 
insight,  are  unimportant. 

5.  TYPICAL  FORMS 

We  will  use  specializations  of  (1.1)  and  (2)  and  (3)  to  illutstrate 
our  procedures  for  testing  the  structure  of  a  market  in  the  form  of  a  series 
of  "nested  hypotheses."  We  now  introduce  a  double  subscript  for  the  discrete 
probability  distributions  with  which  we  are  concerned.  The  modes  is-/ 


-^The  well-known  SUMT  program  of  Fiacco  and  McCormick,  1968,  is  available 
for  solving  this  kind  of  problem  via  its  dual. 
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linimize  KPij^ij)  =  Pij  ln  ^ 


subject  to 


where 


1 "  A  pu’  pu-0,Vl,j’ 
* 


0  =  R.  -  R.  ,  etc., 

1  J 


1  -  p(i,i)/  P4 
- 

^  .  1  -  p(j,j)/  P, 

R.  - - - - - - J 

J  1  ’  Pj 

and  the  p.  and  p.  indicate  posited  market  share  values,  i.e.,  these  values  are 

*  J 

not  to  be  estimated  by  this  minimization  procedure.  Here  p ( i , i )  =  p^.  These 
values  are  to  be  estimated  along  with  the  p. i  which  represent  proportions 
switching  between  products  i  and  j  over  the  two  purchase  occasions. 

A  A 

The  estimated  R.  =  R.  represent  vulnerability  ratios  hypothesized 

'  J 

to  accord  with  "nestings"  indicated  by  arrangements  like 

A  A 

0  -  Rj  -  Rj 

)  /s  A 

0  »  R.  -  R. 

j  k 

when  products  i,  j  and  k  are  hypothesized  to  be  in  the  same  vulnerability  class. 

To  simplify  notation  we  omit  the  circumflexes  on  the  R^ ,  Rj,  Rk  and 
designate  optimal  estimates  by 


★ 

Ri  ■ 


Pi*/Pj 


P../P4 
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The  resulting 


(18)  I*  *  I  (p^  ,  ir1  j) 

may  then  be  used  to  test  the  hypothesis  that  the  minimizing  p^  choices  do 
not  deviate  from  the  distribution  associated  with  the  ir^.  The  rr^ 
represent  "switching  proportions"  hypothesized  in  our  case  to  accord  with 
the  data  represented  in  Table  2, 

TABLE  2 


K 

1 

2 

3 

4 

5 

6 

1 

.238 

.077 

.033 

.020 

.013 

.014 

2 

.064 

.143 

.033 

.011 

.008 

.011 

3 

.036 

.032 

.054 

.008 

.008 

.007 

4 

.003 

.015 

.011 

.031 

.008 

.003 

5 

.017 

.006 

.004 

.006 

.015 

.001 

6 

.010 

.004 

.006 

.003 

.003 

.015 

where  the  ir^  data  of  Table  2  are  the  Table  1  data  expressed  as  fractions. 

★ 

The  p.^  values  are  to  be  secured  by  solving  (14)  using  a^l  of  the  data 
(off-diagonal  as  well  as  diagonal)  in  Table  2. 

A  characterization  which  provides  access  to  the  statistical  properties 

as  developed  in  Kullback,  1959,  for  the  MDI  statistic  may  be  obtained  from 
the  widely  shared  belief  that  brand  switching  is  proportional  to  brand  shared 


-  See  Ehrenberg  and  Goodhardt,  1974,  where  this  is  stated  on  p.  232  as 
the  probability  of  switching  from  i  to  j  depends  only  on  the  purchase 
probabilities  p^  and  pj  (which  are  equivalent  to  market  shares)." 


and  with  R*  regarded  as  a  switching  constant,  this  Is  reflected  in  (7). 

Although  this  empirical  "law"  has  been  regarded  as  implying  that  one 
has  switching  equilibrium  In  a  zero  order  process,  there  are  other  less 
restrictive  assumptions  which  can  guarantee  such  observations  empirically. 

For  example,  suppose  that  the  underlying  distribution  which  yields  these 
switches  is  multinomial  in  character  with  probabilities  p^,  i  *  1,  2,  ...»  n. 
It  then  follows  from  the  multivariate  form  of  the  central  limit  theorem 
that  the  observations  tend  to  the  multi-normal  distribution  whose  covariance 
matrix  is  symmetric  with  off-diagonal  terms  proportional  to  the  product  of 
the  corresponding  brand  share. Thus,  for  large  samples,  one  should  then 
expect  to  find  "brand  switching  proportional  to  the  product  of  brand  shares." 
Conversely,  a  finding  of  statistically  significant  deviations  will  result 
in  rejection  of  the  empirical  "law"  that  switching  is  proportional  to  brand 
share. 


6.  ALGORITHM  AND  TESTING  PROCEDURE 

The  results  provided  in  (6)  allowed  us  to  reduce  the  number  of  brand 
combinations  to  be  considered  in  a  market  segmentation  analysis.  This  is 
important  also  when  we  come  to  effecting  our  statistical  tests  since  one 

A  /N 

might  (in  principle)  have  to  consider  a  great  number  of  possible  R^,  Rj 
pairs  to  test  in  (15). 


X/For  a  more  detailed  discussion  with  accompanying  further  references, 
~  see  Charnes,  Cooper,  Learner  and  Phillips,  1980a. 
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We  therefore  suggest  the  following  algorithm  which  is  based  on  (7) 
in  section  3: 


Order  the  R^  values  from  smallest  to  greatest  as 
R(l)»  r(2)’  •••»  R(n)’  as  in  Table  3  below.  See  (8).  By 
MDI,  test  the  hypothesis  that  R^  and  R^j  are  equal  at  a 
specified  level  of  significance.  If  the  resulting  value  of 
the  MDI  statistic  rejects  the  hypothesized  equality  then  seg¬ 
ment  R^j  from  r(2)*  Then  begin  with  R^j  as  a  smallest  R^ 
to  test  for  a  further  segment  apart  from  6(3) •  If.  instead, 
the  test  accepts  the  hypothesized  equality  of  R^  and  R(2)» 
add  the  additional  condition  to  R^)  =  R(3)*  Test  with 
the  MDI  statistic.  If  the  test  accepts  the  hypothesized  equality 
0f  R(2)  and  R^3)  go  on  to  new  higher  R^  by  adding  the  new 
pertinent  additional  equality  conditions. 

To  illustrate  our  suggested  procedures  for  using  these  MDI  values, 
we  now  refer  to  the  following  Ri  values  from  the  results  portrayed  in  (8), 
but  arranged  as  indicated  by  our  algorithm. 


TABLE  3 


i : 

2 

1 

6 

4 

3 

5 

V 

.656 

.660 

.704 

.719 

.723 

.750 

Applying  this  algorithm  we  test  the  hypotheses  indicated  by  the 
nestings  (from  top  to  bottom)  under  the  column  labelled  "Null  Hypothesis" 
in  Table  4.  With  this  formulation,  Ehrenberg  and  Goodhardt  apparently 
are  correct,  or  at  least  their  "one  homogeneous  market"  conclusion  can¬ 
not  be  rejected  on  the  basis  of  these  data  for  all  sample  sizes,  N  5  6,500 
and  significance  level  a  =  0.95. 


TABLE  4 


Problem 

Null 

Hypothesis 

I* 

d.f  .# 

N  <  6500 

a  =  0.95 

1 

R2  = 

"i 

Negligible 

1 

accept 

2 

*2  = 

R1  ’  R6 

1.24  x  10‘4 

2 

accept 

•  3 

R2  ■ 

R1  3  R6  =  R4 

3.60  x  10-4 

3 

accept 

4 

R2  = 

R1  =  R6  =  R4  =  R3' 

5.94  x  10‘4 

4 

accept 

5 

R2  ■ 

R1  3  R6  3  R4  3  R3  =  R5 

8.54  x  10“4 

5 

accept. 

#The  degrees  of  freedom  are  generally  equal  to  the  number  of  linearly  independent 
constraints— with  a  rigorous  derivation  relating  them  to  the  number  of  parameters 
in  the  dual  system.  See.  e.g.,  pp.  38-39  and  181  ff.  in  Gokhale  and  Kullback, 
1978a.  See  also  Gokhale  and  Kullback  1978b. 


In  this  case  we  have  not  rejected  the  hypothesis  of  no  market  segments. 

In  the  opposite  case  we  would  simply  push  to  a  next  level  of  analysis  by 
similarly  testing  for  equality  of  the  vulnerability  ratio  within  each  segment 
using  multiple  subscripted  R^s.  Instead  of  testing  such  hypothesized  segments 
one  at  a  time,  we  could  simultaneously  test  for  equality  (e.g.,  equality  with 
different  switching  constants)  in  each  hypothesized  segment  while  refraining 
from  imposing  similar  constraints  on  other  parts  of  the  jnarket. 

We  can  also  test  for  market  homogeneity  in  other  ways.  For  instance  we 
can  test  for  market  homogeneity  based  on  requiring  the  estimated  p^j  to  conform 
to  the  conditions  for  switching  equilibrium.  This  requires  the  adjunction  of 
additional  constraints  as  in  the  following: 
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suoject  to 


min  I  ( p : ir ) 


(19) 


6  6 

z  z 

1*1  j*l 


pij 

p1j 


Vi  t  j 

Vi.j  . 


The  resulting  estimates  are  portrayed  in  Table  5  with  I*  =  0.007  and  15 
degrees  of  freedom.  The  hypothesis  of  "switching  equilibrium"  is  not  rejected 
for  all  N  £  1430  at  a  *  0.95.—^ 

TABLE  5 


★ 

p.  •  Switching  Equilibrium  Values  ,  i  f  j 


>4. 

1 

2 

_ 3 

4_ 

5 

6 

1 

.240 

.071 

.035 

.025 

.015 

.012 

2 

.071 

.144 

.033 

.013 

.007 

.007 

3 

.035 

.033 

.054 

.009 

.006 

.007 

4 

.025 

.013 

.009 

.031 

.007 

.003 

5 

.015 

.007 

.006 

.007 

.015 

.002 

6 

.012 

.007 

.007 

.003 

.002 

.015 

-^Because  we  are  using  these  data  only  for  illustration  we  do  not  adjust 
these  a  values  in  these  successive  tests. 
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Apparently  the  market  portrayed  in  Table  1  is  consistent  with  the 
hypothesized  switching  equilibrium  and  the  p.*  values  portrayed  in  Table  5 

*  J 

represent  "best"  estimates  of  the  switching  probabilities  in  the  sense  that 
they  are  "as  close  as  it  is  possible  to  get"  to  the  n..  values  in  Table  2 

*  J 

while  retaining  the  symmetry  conditions  for  switching  equilibrium  as 
reflected  in  (19). 

Turning  next  to  the  question  of  market-share  equilibrium  we  now 
formulate  our  model  for  purposes  of  testing  this  hypothesis'  by  incorporating 
these  conditions  explicitly  as  follows:-^ 

min  I  (p:m) 


subject  to 


(17) 


Z  P1j  =  P1 
j=l  1 


I  P,-<  s  P 


i=l 


ij  rJ 


A  A 

Ri 


A  A 

Rk  ■  Re 


V'.J 


where  p.  and  p.  represent  the  average  of  the  rim  values  from  Table  2  with 
_  1  J  6  _  6  _ 

p.  =  p.  whenever  i=j  and  Ip.  =  [  p.  =  1.  In  other  words,  we  formalize  the 

J  i =1  j-1  J 

assumption  of  an  hypothesized  market  share  equilibrium  and  then  repeat  the 


—  Compare  with  (14)  and  (15).  Although  Ehrenberg  and  Goodhardt  are 
clear  in  their  discussion,  no  such  tests  were  conducted  by  them  —  possibly 
because  classical  statistical  mechanisms  are  not  designed  to  deal  with  such 
explicitly  constrained  models  involving  "external  constraints"  in  rather 
'omplex  arrays. 


same  tests  as  in  (14)  to  ascertain  whether  the  conclusion  of  a  single 
homogeneous  market  is  correct  and  obtain  the  results  shown  in  Table  6. 

TABLE  6 

MARKET  SEGMENTATION  WITH 
MARKET  SHARE  EQUILIBRIUM 


Problem 

Null  Hypothesis 

I* 

d.f. 

N=3750 

a=Q.95 

1 

R1=R2 

2. 78x1 0"3 

12 

accept 

2 

R1=R2=R6 

3.Q2xlO~3 

13 

reject 

3 

R1=R2  ;  R6sR4 

2.88xl0-3 

13 

accept 

4 

R1=R2  ;  R6=R4=R3 

2.89xl0"3 

14 

accept 

5 

R1=R2  S  R6=R4=R3=R5 

2.99xl0'3 

15 

accept 

For  N  <  3,750  the  Ehrenberg-Goodhardt  conclusion  of  a  single  homogeneous 
market  continues  to  be  maintained  as  before.  For  N  =  3,750  the  hypothesis  of 
market  homogeneity  would  be  rejected  since  at  this  sample  size  (and  above) 
the  market  segments  into  two.  Thus  at  these  sample  sizes  the  market 
stationarity  which  Ehrenberg  and  Goodhardt  assumed  in  their  averaging  is  not 
sustai ned. 

★ 

We  also  provide  the  p.  •  estimates  corresponding  to  the  hypothesis  of 

i  J 

problem  5  in  table  6.  These  are  presented  in  Table  7.  Note  that  these  values 
are  consistent  with  the  indicated  segmentation  in  N  =  3,750.  At  this  sample 
size,  the  results  in  Tables  4  and  6  are  consistent  and  at  N  <_  1,430  they  are 
also  consistent  with  the  concept  of  switching  equilibrium.  At  sample  sizes 
larger  then  N  =  1,430,  however,  the  hypothesis  of  switching  equilibrium  is 
rejected  while  the  hypothesis  of  market  -  share  equilibrium 
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continues  to  be  maintained  for  1,430  <  N  <  3,750.  Evidently,  the  two  concepts 
are  not  the  same.  Thus,  even  with  an  hypothesized  and  validated  market-share 
equilibrium  the  vulnerability  ratio  continues  to  provide  valuable  information 
by  the  way  it  summarizes  the  net  switches  in  and  out  and  by  the  way  it 
designates  the  more  vulnerable  products  by  reference  to  repeat  buying 
relative  to  market  share. 


TABLE  7 

ESTIMATES  OF  SWITCHING/REPEAT  PROBABILITIES  IN  A  TWO-SEGMENT 
MARKET  UNDER  MARKET-SHARE  EQUILIBRIUM 


B 

1 

2 

3 

4 

5 

6 

1 

i 

1 

i 

.239 

.076 

.033 

.024 

.012 

- 1 

.012 

! 

i  2 

.065 

.143 

.034 

.014 

.007 

.010 

3 

.035 

.031 

.054 

.010 

.007 

.006 

1 

j  4 

.028 

.013 

.010 

.031 

.006 

.002 

1  5 

.017 

.006 

.004 

.007 

.016 

.001 

!  6 

1 

.012 

.005 

.007 

.004 

.003 

.014 

i 


1  / 

-See  the  discussion  in  Section  4. 
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Supposing  that  the  sample  size  is  adequate,  the  estimates  in  Table  7 
provide  further  insight  into  the  Table  6  results.  Evidently  the  switching 
probabilities  between  brands  1  and  2  are  high  relative  to  the  rest  of  the 
market.  Moreover,  the  repeat-purchase  probabilities  for  each  of  brands 
1  and  2  are  also  relatively  high. 

Presumably  this  finding  would  next  be  expanded  in  terms  of  further 
analyses  such  as  whether  this  segment  exhibited  brand-primary  or  form¬ 
primary  characteristics,  etc.-^  Our  interpretation  of  the  values  as 
"vulnerability  ratios"  can  also  be  used  to  yield  conclusions  of  possible 
managerial  significance  which  we  can  illustrate  with  the  results  in 
Tables  6  and  7.  Thus,  via  the  results  in  Table  6,  the  marketing 
managers  of  brands  1  and  2  might  be  advised  to  turn  their  attention  to 
the  segment  identified  via  Rg  =  ^4  =  =  ^5*  As  seen  Table  7,  the 

repeat  purchase  probabilities  in  this  segment  are  all  relatively  low. 

The  switching  probabilities  between  the  brands  in  this  segment  are  also 
relatively  low  and  the  fact  that  they  are  all  classified  together  by  our 
statistical  tests  means  that  the  marketing  managers  of  brands  1  ar.d  2 
should  orient  their  strategy  considerations  to  the  whole  (equally 
vulnerable)  segment.  To  state  this  last  point  somewhat  differently,  if 
our  statistical  tests  had  identified  a  further  decomposition  then  the 
marketing  managers  of  brands  1  and  2  would  be  better  advised  to  consider 
the  different  vulnerability  values  and  the  strategies  suitable  to  taking 
advantage  of  them. 
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CONCLUDING  REMARKS 


Our  illustrations  have  been  confined  to  the  case  of  external  constraints. 
Examples  using  only  internal  constraints  (which  are  necessarily  consistent  with 
the  observations)-^  could  also  be  employed.  Combinations  of  internal  and 
external  constraints  are  possible,  of  course,  but  these  involve  more  recondite 
analyses  since  a  general  theory  for  their  treatment  is  not  yet  available. 

Our  illustrations  also  involve  only  linear  equality  constraints  where, 
again,  the  theory  is  fully  developed  and  readily  available.  More  general 
non-linear  relations  and  even  inequality  constraints  are  only  now  beginning 
to  be  addressed.—/ 

Even  while  allowing  for  these  limitations,  a  great  variety  of  other 
possibilities  are  also  present  besides  those  illustrated  in  this  article, 
and  the  accompanying  algorithm  and  models  we  have  provided  should  help  to 
provide  at  least  a  start  for  the  use  of  such  MDI  methods  in  marketing. 

This  includes  possibilities  for  further  testing  of  sharpened  hypotheses  when 
one  is  willing  to  make  more  specialized  assumptions  about  the  statistical 
distributions  and/or  (as  in  Sabavala  and  Morrison,  1977)  the  underlying 
probability  models  governing  the  behavior  of  individual  consumers.  It 
also  opens  contacts  with  mathematical  programming  (and  its  duality  relations) 
which  have  proved  of  great  value  in  other  types  of  modelling  not  only  for 
the  computational  power  that  is  provided  but  also  for  a  great  variety  of 
managerial  uses  and  interpretations  including  those  of  a  policy  evaluation 
variety.  The  way  is  opened  for  using  such  models  with  their  explicitly 

1/See  p.  181  in  Gokhale  and  Kullback,  1978a. 

2/See  Charnes,  Cooper  and  Tyssedal  (forthcoming)  for  a  recent  extension 
of  the  duality  relations  described  in  Charnes,  Cooper  and  Seiford,  1978, 
to  comprehend  situations  involving  inequality  constraints. 
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formulated  constraints  to  reflect  a  variety  of  managerial  policy  conditions 
(e.g.,  on  market  share,  etc.)  which  can  then  be  tested  and  evaluated. 
Proposed  policies  and  marketing  activities  as  reflected  in  external 
constraints  can  then  be  related  to  costs  and  benefits  flowing  from  their 
imposition  or  accomplishment.  The  ultimate  managerial  benefit  is  that 
approaches  along  these  lines  can  serve  to  direct  management  to  activities 
that  ought  to  be  considered  and  to  provide  a  way  of  evaluating  various 
mixtures  of  policies  relative  to  their  possibilities  for  realization  in 
the  market. 
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